A Ground Truth For Half A Million Musical Incipits
نویسندگان
چکیده
Musical incipits are short extracts of scores, taken from the beginning. The RISM A/II collection [6] contains about half a million of them. This large collection size makes a ground truth very interesting for the development of music retrieval methods, but at the same time makes it very difficult to establish one. Human experts cannot be expected to sift through half a million melodies to find the best matches for a given query. For 11 queries, we filtered the collection so that about 50 candidates per query were left, which we then presented to 35 human experts for a final ranking. We present our filtering methods, the experiment design, and the resulting ground truth. To obtain ground truths, we ordered the incipits by the median ranks assigned to them by the human experts. For every incipit, we used the Wilcoxon rank sum test to compare the list of ranks assigned to it with the lists of ranks assigned to its predecessors. As a result, we know which rank differences are statistically significant, which gives us groups of incipits whose correct ranking we know. This ground truth can be used for evaluating music information retrieval systems. A good retrieval system should order the incipits in a way that the order of the groups we identified is not violated, and it should include all high-ranking melodies that we found. It might, however, find additional good matches since our filtering process is not guaranteed to be perfect.
منابع مشابه
Transportation distances and human perception of melodic similarity
• ABSTRACT This article describes how transportation distances such as the Earth Mover’s Distance can be used for measuring melodic similarity for notated music. We represent music notation as weighted point sets in a two-dimensional space of onset time and pitch. The Earth Mover’s Distance can then be used for comparing point sets by determining how much work it would take to convert one of th...
متن کاملSearching musical incipits by means of sequence alignment
Introduction Various methods for symbolic melodic search have been proposed, from string matching based approaches to geometric models [3]. A study on melodies in the Dutch Song Database (www.liederenbank.nl) has shown that sequence alignment methods are quite powerful at dealing with melodic variability [2]. We investigate the potential of such methods for retrieving RISM incipits, comparing 2...
متن کاملAutomatic Tune Family Identification by Musical Sequence Alignment
Musics, like languages and genes, evolve through a process of transmission, variation, and selection. Evolution of musical tune families has been studied qualitatively for over a century, but quantitative analysis has been hampered by an inability to objectively distinguish between musical similarities that are due to chance and those that are due to descent from a common ancestor. Here we prop...
متن کاملThe Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth
The tuning system of a keyboard instrument is chosen so that frequently used musical intervals sound as consonant as possible. Temperament refers to the compromise arising from the fact that not all intervals can be maximally consonant simultaneously. Recent work showed that it is possible to estimate temperament from audio recordings with no prior knowledge of the musical score, using a conser...
متن کاملAutomatic Recognition of Samples in Musical Audio
Sampling can be described as the reuse of a fragment of another artist’s recording in a new musical work. This project aims at developing an algorithm that, given a database of candidate recordings, can detect samples of these in a given query. The problem of sample identification as a music information retrieval task has not been addressed before, it is therefore first defined and situated in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JDIM
دوره 3 شماره
صفحات -
تاریخ انتشار 2005